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Abstract 

We investigate implementations of the Eisert-Wilkens-Lewenstein scheme of 
playing quantum games beyond strategic games. The scope of our research are 
decision problems, i.e., one-player extensive games. The research is based on the 
examination of their features when the decision problems are carried out via the 
EWL protocol. We prove that unitary operators can be adapted to play the role of 
strategies in decision problems with imperfect recall. Furthermore, we prove that 
unitary operators provide the decision maker possibilities that are inaccessible for 
classical strategies. 

PACS numbers: 02.50.Le, 03.67.Ac 

1 Introduction 

We present two new applications of the Eisert-Wilkens-Lewenstein (EWL) protocol pQ. 
The subject of applications are decision problems with imperfect recall. Two studied 
applications correspond to two main issues concerning such problems. The former deals 
with the problem of no outcome-equivalence between mixed and behavioral strategies 
that arises in games with imperfect recall. We prove that extending the set of actions 
to unitary operators may remove this non-equivalence. The latter part of our paper 
concerns the problem of payoff maximization in the well-known decision problem called 
the paradox of absentminded driver. We reexamine the unitary operators treated as 
actions in the EWL scheme applied to the paradox. We show that application of general 
unitary operators yields benefit to the decision maker. The strategy space SU(2) allows 
the decision maker to get payoffs that are inaccessible by any classical strategy. We 
also study the generahzed EWL protocol extended to more than two qubits and we 
demonstrate that some decision problems can be treated by this generalized scheme. 

2 Preliminaries to game theory 

Definitions in this section are derived from [2j and [3]. Readers who are not familiar 
with game theory are encouraged to get acquainted with these books. The main object 
that we are interested in is a decision problem. It is based on the formal definition of 
the extensive game [2] where only one player acts. We restrict this term as much as it 
is sufficient to be within the scope of our study. 
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Definition 2.1 A decision problem is a triple T = {H,u,I) where: 

1. H is a finite set of sequences called histories that satisfies the following two prop- 
erties: 

(a) The empty sequence ^ is a member of H . 

(b) If {ak)k=i,2,-,K e H and K > 1 then (afc)fe=i,2,..,i^-i e H- 

A history {ak)k=i,2,...,K & H is interpreted as a feasible sequence of actions taken 
by the decision maker. The history (oi, 02, . . . , G H is terminal if there is no 
(ai, 02, . . . , o/f, a) G H. The sets of nonterminal and terminal histories are denoted 
by D and Z respectively. The set of actions available to the decision maker after 
a nonterminal history h is defined by A{h) = {a: {h,a) G H}. 

2. u: Z ^ M is a utility function which assigns a number (payoff) to each of the 
terminal histories. 

3. The set of information sets, which is denoted by I, is a partition of D with the 
property that for all h, h' in the same cell of the partition A{h) = A{h'). Every 
information set li of the partition corresponds to the state of decision maker's 
knowledge. When the decision maker when makes move after certain history h 
belonging to li, she knows that the course of events of the decision problem takes 
the form of one of histories being part of this information set. She does not know, 
however, if it is the history h or the other history from It. 

The main method for describing decisions taken by a decision maker is based on planning 
actions before she starts with her first move. Every such plan is called a pure strategy: 

Definition 2.2 A pure strategy s is a function which assigns to every history h E D an 
element of A{h) with the restriction that if h and h' are in the same information set, 
then s{h) = s{h'). 

Let us denote by e{h) experience of the decision maker. It is the sequence of information 
sets and actions of the decision maker along the history h. According to [3j, a decision 
problem has imperfect recall if there exists an information set that contains histories h 
and h' for which e{h) 7^ e{h') i.e., a decision maker forgets some information about the 
succession of the information sets and (or) some of her own past moves that she knew 
earlier. 

The strategy set of a decision maker can be extended to random strategies. There 
are two ways of randomizing. One of them, known from strategic games, specifies 
probability distribution over the set of pure strategies and is called mixed strategy. The 
other specifies probability distribution over the actions available to decision maker at 
each information set: 

Definition 2.3 A behavioral strategy b is a function which assigns to every history 
h E D a probability distribution h{K) over A{h) such that h{h) = h{h') for any two 
histories h and h' which belong to the same information set. 

Since different randomization of strategies may imply the same utility payoff, a possi- 
bility to measure what result particular strategy produces is required: 
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Definition 2.4 Let mixed or behavioral strategy a in a decision problem be given. The 
outcome 0{a) of a is the probability distribution over the terminal histories induced by 
a. If two different strategies a and a' induce the same outcome then they are outcome- 
equivalent. 

The behavioral and mixed strategy ways of randomization are outcome-equivalent in 
decision problems (more generally in extensive games) with perfect recall. In problems 
with imperfect recall some outcomes may be obtained only through a mixed strategy 
or only through a behavioral strategy (see and 0). This issue will be studied in 
Section 4. 

3 EWL scheme for quantum 2x2 strategic game 

The generalized Eisert-Wilkens-Lewenstein scheme [1] is defined by the following com- 
ponents: 

1. an entangling operator J composed of the identity operator / and Pauli opera- 
tor ax'. 

J = ^{I ® I + iax®ax), (1) 
v2 

2. unitary operators Uj, j = 1, 2, from the space SU(2) of the form [B]: 

Uj{e,a,l3) = cos ^Aj + sin ^Bj for % G [0, vr] , (2) 
where Aj and Bj are defined as follows: 

Bj\0) = e^(f-ft)|l), Bj\l) = e*(§+^^)|0), for a, /3 G [0, 27r). 

3. a payoff function E(u) defined as expected value of a discrete random variable u 
with the values {uki G : k,l = 0,1} being payoffs associated with the outcomes 
of a classical bimatrix 2x2 game, and the probability distribution p^/ defined by 
KV'/l/cOP where {ipf) = J^(?7i ® U2)J\00) and {\kl)}k,i£{o,i} is the computational 
base of C2 ® C^: 

E{u){U,^U2)= Yl Uki\{^Pf\kl)\'. (3) 

k,ie{o,i} 

As operators Uj depend on parameters 9j,aj,f3j we will sometimes denote payoff 
function E{u){Ui ® U2) as E(m)(6'i, ai, f3i, 6*2, 02, /92)- If each terminal history h of 
a decision problem will be associated with some outcome Oh instead of some value 
of u{h) we will write E(0). 

In the EWL protocol two players select local operators from and each of them act 
on their own qubit initially prepared in the |0) state. For a more detailed description 
we encourage the reader to get acquainted with the prototype of the EWL scheme in [1] 
and other papers, for example [7], [8] where authors have investigated properties of the 
EWL protocol and compared them with the classical 2x2 game. 



3 




Figure 1: Examples of decision problems with imperfect recall. 

4 Decision problems with imperfect recall via EWL 
scheme 

The EWL scheme devised initially for a symmetric game Prisoner's Dilemma has already 
been used for 2x2 games with different properties in [7] and [9], and games with a bigger 
number of strategies available to players [10]. Let us consider player's knowledge in 
a strategic game at the moment of taking an action. It is the same as in the case of 
extensive games the property of which is that, players do not have a possibility to watch 
the opponents' move. Another similarity manifests itself, for instance, in a decision 
problem when a decision maker has forgotten the actions chosen in some of previous 
stages. Thus, such examples indicate a possibility of applying the EWL protocol to this 
type of games as well. Our aim is to adjust the EWL scheme to quantize two well-known 
decision problems with imperfect recall. 

4.1 Application 1 

The first example is taken from [2]. A decision maker is faced with a choice between 
two possibilities. When she makes a move, she has a choice of two actions once more. 
A significant feature of this problem is that before taking another action the decision 
maker forgets what action she has chosen previously. Therefore, this problem exhibits 
imperfect recall. The formal description {H, 0,1) of this example according to Definition 
12.11 (with a small substitution of the payoff function u by an outcome function O) is as 
follows: 

H = {0, ao, ai, (ao, &o), (oo, h), (oi, &o), h)}; 

0{akM)=Oku where /c,/G {0,1}, / = {0, {ao, ai}}. (4) 

The decision problem in a 'tree' language is shown in Figure [T^. As we have mentioned 
in preliminaries, the decision maker has two different ways to precise her decision - 
expressed as mixed strategies or as behavioral strategies. Her set of pure strategies is 
{ao^O) c^o^i) o-iboi o-ibi} where the first (second) entry of affii means an action taken by the 
decision maker when she is in the first (second) information set. Thus, she can choose 
mixed strategy as a probability distribution over akh. On the other hand the decision 
maker can specify an independent probability measure over actions available at each 
information set i.e., her behavioral strategy is of the form ((p, 1 — p), (g, 1 — g)). This 
example shows no outcome-equivalence between mixed and behavioral strategies. To be 
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precise, there are outcomes induced by some mixed strategies that are not achievable 
by any behavioral strategy. To see this, let us consider an outcome of the form piOoo + 
(1 — pi)oii, p G (0, 1). This outcome is obtained from a mixed strategy (pi, 0, 0, 1 — Pi). 
However, no behavioral strategy can yield this outcome. To see this, notice that any 
behavioral strategy ((p, 1 — p), {q, 1 — q)) must assign probability p{l — g) equal to 
yield the outcome piOoo + (1 — Pi)oii. It implies that the probability of obtaining either 
ooo or Oil is equal 0. 

Let us take a look how the EWL scheme can be applied to the problem described above. 
Looking at the game tree we can see that this problem has the same structure as an 
extensive form of a 2 x 2 strategic game. The decision maker before making her first 
move takes on the role of the player 1 and afterwards takes an action available for the 
player 2. As she has forgotten the action taken previously, she has the same knowledge 
of the game as players in 2 x 2 game. It is therefore natural to adapt to this problem 
the EWL scheme when the decision maker chooses some unitary operator Ui with which 
she acts on the first qubit |v?)i and subsequently applies a unitary operator U2 on the 
second qubit \(p)2- Then a payoff is calculated through the formula ([3]). The formal 
description of this problem in a manner comparable to (jl]) is as follows: 

H' = {(D,u,\^),,m^),,u2\^h)y, 

u' = Eiom®u2), i' = {mi},m2}}- (s) 

It should be emphasized that we do not try to identify each component from Definition 
12.11 with components of dSj). Specification takes on the informal character. It is 
aimed at organizing our deliberations. The main feature of the EWL scheme is that it 
comprises corresponding classical game i.e., there exists a set of unitary operators that 
yield the same outcomes as classical strategies. That is, classical actions can be realized 
via I and as it has been shown, for example, in P]. The decision maker's strategies 
are not single actions in the problem however. They are plans that describe what 
does the decision maker do in each of her two information sets (see Definition 12. 2p . i.e., 
what unitary action she performs on each of two qubits individually. Therefore, her set 
of pure classical strategies can be described as { J ® J, / ia^, icr^ ® /, icr^ ® icx}- Then 
a classical mixed strategy can be obtained by 

i/pTJ (g) / + ^/p^I iax + \fpzicyx ® / + ^fpi^Ox ® (6) 

where I^/Pip are the values of some probability mass function (notice that ^/p^ can take 
complex values). In general case a (pure) unitary strategy takes the form a\,fi\)® 
U2{92, 0(2, 1^2) ■ In games represented by bimatrices the equivalence (with respect to out- 
comes that can be achieved) between classical actions and some fixed unitary operators 
is sufficient to claim that a quantum realization generalizes the classical game. In ex- 
tensive games, particularly in the decision problem ([5]), it seems natural to find unitary 
strategies that realize classical behavioral strategies, not only mixed strategies. Follow- 
ing we know that a unitary strategy Ui{9i,0,0) ® [72(6*2,0,0) must imitate some 
classical move of the decision maker. This strategy corresponds exactly to classical be- 
havioral strategy ((p, 1 — p), (g, 1 — g)). If we assume p = cos^(6'i/2) and q = cos^(6'i/2), 
we obtain from ([3]): 




Since one-parameter operators Uj{6j, 0, 0) implement classical moves, a natural question 
arises: what is the role of wider range of unitary strategies in the decision problem (|5])? 
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The answer to this question is surprising: Extension of the set of behavioral strategies 
to the set SU(2) (8)SU(2) causes outcome-equivalence of behavioral strategies with mixed 
strategies. Notice that this problem is not trivial because there is no identity of the 
expression and t/i(6'i, ai, f3i) ® f/2(^2, «25 /32)- For example, if one puts ^/pi = ^/pl = 
and y/p2 = ^/p3 = 0, there is no representation of (E]) in the form of the tensor 
product of ([2]). On the other hand when we take, for example, ai = 02 7^ then the 
tensor product Ui{9i,ai, f3i) ® ^2(^2, ^25 /32) has not the form of a mixed strategy for 
any angles 6^1, 6*2, /32- However, the following statement is true: 

Proposition 4.1 For any mixed strategy of a decision maker in the decision problem 
^ there is an outcome-equivalent pure unitary strategy. 

Proof. The set of outcomes yielded by all mixed strategies is a convex hull of ele- 
ments {ooo; Ooi, Oio, Oil} due to the expression for a mixed strategy or, equivalently, 
a mixed strategy of the decision problem We will prove that any convex com- 
bination ;g{o 1} PfciOfci can be written as an expected outcome E{0){Ui U2) = 
ie{o i}'^ki\{i'f\kl)\'^ for some unitary operations Ui and U2 from SU(2). At first 
let us consider the case poo = Pn = or poi = Pio = 0. Then the convex com- 
bination Y.k,i(^{o,i}PkiOki is a segment poiOoi + PioOw or pooOoo + PuOu, respectively. 
Putting t/i(0,ai,0) ?72(0,0,0) we get E(0) = oqo cos^ ai + onsin^ai that is a seg- 
ment linking points oqo and on. Similarly, if we take Ui^n, 0, f3i) ® f/2(0, 0, 0) we obtain 
Ooi sin^ /3i + oiocos^/3i. Now, let us examine general convex combination of points Oki 
such that Poo + Pii and poi + Pio 0- The combination E(0) associated with 
Ui{9i, ai, /3i) ® U2{0, 0, 0) is of the form: 

9 9 
(oqo cos^ ai + Oil sin^ ai) cos^ — + (oqi sin^ f3i + Oio cos^ f3i) sin^ —. (8) 

Comparing the coefficients of the combination J2k /e{o 1} PkiOki and (jH]) we obtain the 
system of equations that has a unique solution: 

2 ^1 I 2 Poo 2 n Pio /^.N 

cos —=Poo+Pu, cos ai = ■ , cos (Ji = ■ . (9) 

2 Poo + Pu Poi + Pio 

The result (Q, together with the first case, finishes the proof. ■ 

Notice that the unitary strategies used in the proof depend only on an operation on the 
first qubit. Due to the fact that the qubits are maximally entangled every outcome can 
be obtained by performing an operation only on the first or only on the second qubit. 

4.2 Application 2 

The next example in which we are going to use the EWL scheme is based on [3] . As the 
previous example, this one also shows difference between reasoning based on mixed and 
behavioral strategies. The application is dealing with the well-known imperfect recall 
problem called the paradox of absentminded driver. Our research is not the first attempt 
to put this problem into quantum domain. The first one appeared in The authors 
of this paper presented the way of quantization with the use of the Marinatto and Weber 
scheme of playing quantum 2x2 games [12] - the initial state plays the main role. In 
outline, for many kinds of the absentminded driver problems various initial states are 
chosen to maximize the driver's payoff. Therefore, we expect that no other protocol 
could be ahead of [TT] in terms of maximization of the driver's payoff. However, the 
quantum version based on the EWL protocol turns out to be a convenient way to make 
an analysis of some complicated cases of the problem of absentminded driver. 
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4.2.1 The paradox of absentminded driver. 

The name of this decision problem is derived from a certain story describing this issue. 
An individual sitting for some time in a pub eventually decides to go back home. The 
way is leading through the motorway with two subsequent exits. The first exit leads to 
a catastrophic area (payoff 0). The choice of the other one will lead the decision maker 
home (payoff A > 2). If he continues his journey along the motorway not choosing 
any of the exits, he will not be able to go back home but he has a possibility to stay 
for the night at a motor lodge (payoff 1). The key determinant is the driver's absent- 
mindedness. This means that when he arrives at the exit he is not able to tell if it is 
the first or the second exit due to his absent-mindedness. This situation is described on 
Figure [T]d. The formal description is as follows: 

H = {0, ao, ai, (ai, ao), (oi, ai), }, / = {0, ai}; 

u(ao) = 0, u(ai,ao) = A, u{ai,ai) = l. (10) 

Let us determine decisions that the driver can make. Since the decision maker has just 
one information set, according to Definition \2.2\ only two pure strategies are available 
to him: 'exit' or 'motorway' with respective terminal histories oq and (oi, ai). Similarly, 
behavioral strategy of the driver will be represented by the same random device in each 
of the two nodes of the information set i.e., it is on the form {p, I — p) where p is the 
probability of choosing 'exit'. Notice first that the driver plans his journey still sitting 
in the bar which is equivalent to choosing some pure strategy. The optimal strategy is 
'motorway' (with corresponding payoff 1) which becomes paradoxical when the decision 
maker begins carrying out this pure strategy. It is better for him, when he approaches 
an exit, to go away from the motorway because he comes to a conclusion that with equal 
probability he is at the first or the second exit. Consequently, his optimal choice will be 
a certain behavioral strategy. For example if A = 4, the expected payoff corresponding 
to the strategy {p, I — p) is expressed hj u{p,l — p) = {1 — p){l + 3p). Maximizing 
u{p,l — p) we conclude that the optimal decision for the decision maker is to choose 
'exit' with probability 1/3 each time he encounters an intersection, which corresponds 
to the expected payoff 4/3. As in the previous example, here as well we can notice lack 
of equivalence between behavioral and mixed strategies. This time, however, behavioral 
strategy is strictly better than mixed one as it ensures strictly higher payoff for the 
driver. Observe, however, that condition A > 2 is essential for this case. Otherwise, 
p = maximizes the expected payoff u{p,l — p) which is equal 1. 

Now, we are going to implement the EWL protocol to this problem. It is possible 
since in the classical example we have again a decision problem with two stages. More- 
over, actions are taken independently at each of these stages as in the 2x2 bimatrix 
game. Further, each player in the 2x2 game has not any knowledge of an action taken 
by his opponent. Therefore, this is the same situation as if the decision maker was in the 
role of the player 1 and then the player 2, and he forgot his previous move. Let us assign 
the state after each action of the classical decision problem with the computational base 
of respective qubit. States induced by actions: 'exit' and 'motorway' available after 
history correspond to |0) and |1) states of the first qubit \(p)i. Similarly, we assign 
states after actions from A{ai) (see Definition 12. ip to base states of the second qubit 
\^p)2- Notice that this is the obvious procedure applied in quantum 2x2 bimatrix games 
where outcomes are assigned to base states \kl) where k,l G {0,1}. We assume, as in 
the classical case, that in the quantum realization the driver is unable to distinguish to 
which qubit he applies a unitary action. Therefore, the two qubits are in the informa- 
tion set. It implies that the same unitary operation U is applied to both qubits. More 
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formally: 



H' = {0, U\^h, iU\^h, U\cp)2)}, I' = {\^h, 1^)2}; 

E{u) (f/«2) ^ A|(V^/|10)p + and \^f) = Jtf/«V|00). (11) 

The core of the issue lies in the payoff function E(m). If state |0) on the first qubit 
is measured (which corresponds to choosing 'exit' at the first intersection), the payoff 
assigned to this state equals regardless of the state measured on the second qubit. 
Therefore, in the expected payoff E{u) includes (|(V'/|00)|2 + \ {^f\01)\'^). Thus, so 
defined quantum realization generalizes the classical case. The classical pure strategies 
can be again implemented by / and ia^ which correspond to 'exit' and 'motorway', 
respectively. These strategies imply operations and (iua;)®^ on both qubits that 
are indistinguishable by the decision maker, and produce outcomes equal to classical 
ones. Although we have assumed that the result on the first qubit determine payoff 
0, we always have to specify operations on both qubits. Like in the classical case, 
also here decision maker's strategy have to precise an action in every possible state 
of a decision problem. As in the previous example one-parameter operation U{6, 0, 0) 
matches classical behavioral strategy and we have: 



E{u){e) = X sin' ^ cos' ^ + 1 sin^ ^. 



(12) 



If we replace cos'(^/2) by p in the formula f ll2p we obtain expected payoff that corre- 
sponds to the classical behavioral strategy {p, 1—p) in the decision problem fllOj) . From 
the classical case ffTOl) we already know that the driver can obtain the maximal payoff 
which is 4/3 when A = 4, using operators of the type U{9, 0, 0). Let us investigate if the 
decision maker can benefit when the range of his actions is extended to any operator 
of the form ([2]). Assume that the driver has two-parameter set of unitary operations 
U{9, a, 0) at his disposal. Then the expected payoff is as follows: 



1 f 9 9 

E{u) (6*, a) = - A(sin 2a + 1) sin 6^ + f sin(2a) cos' - — sin' - 



(13) 



If the driver applies f/(7r/2, 7r/4, 0) to his both qubits, the expected payoff equals A/2. 
Coming back to the case A = 4, he gets 2 utilities instead of 4/3. Moreover, this is 
the highest payoff which the driver can guarantee himself by using any unitary opera- 
tions To prove this, we determine the final state \il)f) when any unitary operation 
U{9, a, /3)®' is given. The matrix representation of \ipf) takes the form: 



2 9 n 2 ^ 

COS 2a cos - + sin 2p sin - 
9,9 

cos - sm -(cos(a — p) + sm(a - 
9,9 

cos - sm -(cos(a — p) + sm(a - 
9 _ . 



V 



sin 2a cos 



cos 2/3 sin 



(14) 



2 ' 2 

Therefore, the state {ipf) is a particular case of the state Itp'jr) of the form: 

\ip'f) = ^ Vki\kl), where r]ki G C, ^ \r]ki\^ = 1 and r^oi = Vw- 
fc,/6{o,i} fc,ze{o,i} 



(15) 
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Figure 2: The n-tuple decision problem of absentminded driver. 

Exchanging \%ljf) by I'l/'j) in (fTT]) . we see that the expected payoff equals A|?7ioP + 

Furthermore, we have A > 2. It implies that the set argmax^^^ (E'(m)) consists of 
all points (/^ic^ii) for which ([r^ioP, l'?!!^) = (1/2,0). It is obvious that in the special 
case \ipf) the equality argmax^^(E(M)(6', a, /?)) = arg max^^(| (■?/'/ 1 10) p) is fulfilled as 
well. As we obtain max^^ (| 1 10) p) = 1/2, the decision maker can achieve maximal 
payoff equal to A/2. Observe that the maximal payoff that the decision maker can get 
in the classical case is A^/4(A — 1). This is strictly less than A/2 if only A > 2. This 
leads us to the conclusion that in the decision problem fITU]) extended to the quantum 
domain there exists the unitary strategy for any A > 2 that is strictly better than any 
classical one. 

4.2.2 The n-tuple paradox of absentminded driver. 

We have already showed the advantage of quantum strategies over classical ones in the 
problem of absentminded driver. Now, we test unitary strategies in the case where the 
driver comes across more than one treacherous intersection. At this moment we make an 
assumption that the absentminded driver problem is characterized by n + 1 intersections 
such that the first n intersections are treacherous ones (payoff when 'exit' is chosen at 
each of these intersections), and only one action 'exit' taken at n+1 intersection leads 
the driver home (payoff A). Choosing the action 'motorway' all the time yields payoff 
1, the same as in f ITU]) . This problem is depicted on Figure [2J A formal description of 
this classical case is similar to f lTU]) . To find an optimal classical strategy we have to 
maximize (1 — p)"'((A — l)p + 1) In quantum version we apply the general EWL protocol 
for (n + 1) X (n + 1) bimatrix game where entangling operator J and final state \il)f) 
take the form given in 

J = + ^af "+i), 1^/) = Jt^7®"+V|0)®"+^ (16) 

v2 

The decision maker carries out some fixed unitary operation U G SU(2) on each oi n + 1 
qubits. In addition, the qubits belong to the same information set: 

H' = {0, U\^)u (f/|y.)i, Ul^h), . . . , {U\^)u U\^)2, . . . , U\^)n+i)}; 

I' = {|^)l>IV^)2, •••,|</')n+l}; 

E(m) = A|(V'/|l)®"|0)p + |(^/|1)®"+^|2. (17) 

We identify the states after actions 'exit' and 'motorway' of the decision problem from 
Figure |2] with states |0) and |1), respectively, exactly like in (ITT]) . This causes the 
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equivalence between the decision problem ( [T7|) with unitary operators reduced to one- 
parameter operators U{6, 0,0) and the classical case, in the same way as in (fTOj) and 
(HH): 

E(^^) = A cos^ ^ sin^" ^ + sin2("+i) ^. (18) 

The n payoffs equal could suggest these ones are essential so that the EWL scheme 
can generalize the decision problem depicted in Figure El but this is not true. In fact, 
any decision problem given by the decision tree depicted in Figure [2] can be implemented 
by the EWL scheme. (Notice that f IlOp represents any decision problem given by the 
decision tree shown in Figure [T]d as it comes down to a problem with payoffs 0, A, 1 
through adding a respective constant to all the payoffs and (or) multiplying all the 
payoffs by a respective constant). To demonstrate that the EWL quantum representation 
defines correctly any kind of the n-tuple paradox, we assign, instead of fixed payoffs, 
outcomes Oi, 02, . . . , On+2 to the terminal histories. Then the n-tuple decision problem 
takes the form: 

H' = {0, ao, a,,..., (a}, a^^), (a}, <+i)}; 

/' = {0,ai,...,(ai,...,<,<+^)}; 

O(ao) = oi, 0{al,...,a\,al^^) = ot+i for t=l,2, ...,n. (19) 

Let us denote by j2, • • • , jn+i) G (S'n+i element of the computational base. 

For any t = 1,2, . . . ,n a symbol {jt,jt+i, ■ ■ ■ , jn+i)2 denotes the binary representation 
of a (decimal) number . 



Proposition 4.2 The decision problem ( [TTy with unitary operators U restricted to 
U{6,0,0) and expected payoff function E(0) extended to include: 



Y,Ot l(^/|l)''*"'|0)|x)|2 «;/iere x=(ji+i,ji+2,...,j„+i)2 (20) 

t=l x=0 

implements the decision problem [W\) . 

Proof. First we calculate f/(^, 0, 0)®"+V|0)®"+^ where J is defined by Then the 

expression y2f/®"+V|0)®"+i takes the form: 

U^y^ cos"-'^(^)+^ - sin'"(J') - - e-'-^y^ cos'^y^ - sin"-'^(^)+^ \y) (21) 

where the element r{y) depends on y = (ji, J25 • • • ,jn+i)2 and is given be the formula 
r{y) = ji + j2 + ■ ■ ■ + jn+i- Let us fix any element \y) from the computational base and 
determine the inner product {y\ipf)- To avoid laborious computation that are necessary 
to obtain complete form of the final state \ipf) we can choose the following simpler way 
to calculate the inner product. We take the bra vector J^. = {{y\ — i{y\)\/2 where 
1^) — '^x"''^^\y)- a language of matrices the element . is the y-th row of a matrix 
representation of . Next, let us put P as a label of a projector \y){y \ + \y){y\- Then 
{y\ipf) can be expressed as follows: 

{yltfjf) = Jt P^®n+ij|o)®"+i. (22) 
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Notice that r{y) and r{y) are connected through equation r{y) + r{y) = n+1. Using this 
fact and a result from (12T|) the amphtude associated with \y) of the state JlO)®""*"^ 
is given by 



1 ( fiv) eos-(^) ^ sin^(^) - - r-'-(^) cos^(^) - sin^(^) - 
v^V 2 2 2 2 



(23) 



In order to complete the state p^®"-+i 710)®"-+^ we just copy the amplitude of \y) from 
fl2Tl) . If we use f l22|) and fl23|) we will receive the final form of {y\ipf): 

(yl^jjj) = f(y) cos'^y) ^ siif^y^ ^. (24) 

The result together with substitution p = cos^(^/2) immediately gives us | ('?/'/ ly) P = 
p'^(y)(^l ^ pyiy) _ After some calculations we conclude from the last result that: 



The sum on the right-hand side of the equation fl25|) is the Newtons formula which is 
equal 1. Therefore, the formula f l25|l leads us to a conclusion that for any t = 1, 2, . . . , n 
the component assigned to Ot of the formula f l20|) can be expressed as: 



Y.m\ir-'mx)\' = {i-pY-'p. (26) 

X 

Equation f l26|) ends the proof as the right-hand side of the equation is a probability of the 
outcome Ot in the decision problem ( IT^ when a behavioral strategy (p, 1 —p) is taken. ■ 

It follows from Proposition 14.21 that every time when we concern the classical problem 
depicted on Figure [2] we can consider the problem (|T7|) when unitary operators (|2]) are 
restricted to one-parameter operators U {9, 0, 0) and the payoff function is given by (!T8|) . 

Let us return to example fll7p where a payoff function is fixed. Let us check if unitary 
operators ([2]) can yield strictly better results than any classical strategies applied to the 
problem. In order to do that, we need to determine the expected payoff E{u){U) defined 
in (dH) for any U{e,a,(3) e SU(2) . We can find the components |(?/'/|l)®"|0)|2 and 
I of E{u){U) with the use of equation fl22l) . After simple calculations we 
get: 

9 6 9 9 

E{u){U) = A|2 cos" - sin - sin(na — /9) + cos - sin" - cos(a — r;,/3)p 

+ l|cos"+i^sin[(n + l)a]+z"+isin"+i^cos[(n + l)/3]|2. (27) 

Comparing the payoffs f lTSl) and flTTI) we really ought to expect better results yielded by 
3-parameter operators. 

Example 4.3 For n = 3 and A = 20 the maximization result of [T8\) and ([^^ is as 
follows: 

classical scenario: maxE('u)(^^) ^ E{u) ^^q) ~ 2,46; 

(TT QtT StT 
— , — , — 
2 16 16 
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This example shows that an increased number of treacherous intersections does not 
reduce the abihty of the EWL scheme to yield benefit to the decision maker. Moreover, 
the ratio max^^^,/? E(m)(6', a, (3) : max^ E{u){9) grows together with A. For large n we had 
better use some mathematical software to determine the precise result of optimization. 
Following proposition assure us that the attempt of finding optimal solution makes sense 
for any n: 

Proposition 4.4 For any 1 in the n-tuple decision problem [T7\ ) there exist a num- 
ber Ao > 1, angles: 9' G (0, vr) and a',f3' G (0, 27r) such that for any payoff \' > Aq the 
unitary strategy U{9', a', (3') yields a payoff strictly higher than a payoff achieved by any 
classical strategy. 

Proof. We showed in subsection 4.2.1 the case when n = 1. So we now assume n ^ 2. 
Let us take the decision problem f lT7|) . For any n ^ 2 let us put Aq = (cos^" |- sin^ |")~^ 
and unitary operator U* {9' , a' , (3') defined by: 

^' = 2arccos^, = + ^-XA{n))n n + 2nxAn) _ 

^/^^+T 2(^2-1) ^ 2(^2-1) ^ ' 

where XaI^) is an indicator function of a. set A = {n: = —1}. Notice that Aq, 9', a' 
and (3' all meet the requirements of the proposition. They depend only on n. Further, Aq 
is well defined as 9' ^ {0, vr}. For any 5 > 1 let us denote A' = SXq the payoff associated 
with the problem (|T7|) . By putting parameters (l28i) into formula (127|) and comparing it 
to (|T8|) . and by using the fact that 9' G argmax^ (cos^ | sin^" |), we obtain the following 
sequence of inequalities: 

E{u){U*) ^ A' f cos^" - sin^ - + cos^ - sin^" = 6 + X' cos^ - sin^" - 
^^^^ V 22 2 2 J 2 2 

9' 9' 

> 1 + A' cos^ - sin^" - ^ max E{u) (9) , (29) 
2 2 6»e[o,7r] 

which completes the proof. ■ 

Let us observe now, how Proposition 14.41 concerns the result of quantization the absent- 
minded driver in subsection 4.2.1. A segment of numbers A in which there exists unitary 
strategy strictly better than classical one, is the segment (2, oo). It can be easily proved 
that for any A G M \ (2, oo) the maximal payoff of the decision problem fllU]) is equal 1 
regardless of used unitary strategies. Proposition 14.41 shows that the problem of finding 
optimal unitary strategy in decision problems with various numbers of the treacherous 
intersections is very much alike. 



5 Conclusion 

We have found the new use of the EWL protocol beyond strategic 2x2 games. It 
turns out once again that game theory defined on quantum domain provides results 
that are inaccessible in classical game theory. We have confirmed through Proposition 
14.41 that we can increase maximal payoff in decision problems carried out via the EWL 
scheme. Our research has allowed to formulate Proposition 14.11 that points out another 
peculiarity of quantum games. Unitary strategies ([2]) that include classical behavioral 
ones (when they are restricted to one-parameter operators) can be outcome equivalent 
to unitary operators implementing classical mixed strategies while behavioral and mixed 
strategies are not outcome equivalent in classical decision problems. These surprising 
features make quantum games worth further thorough studies. 
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